Towards a Better Understanding of Predict and Count Models

نویسندگان

  • S. Sathiya Keerthi
  • Tobias Schnabel
  • Rajiv Khanna
چکیده

In a recent paper, Levy and Goldberg [2] pointed out an interesting connection between prediction-based word embedding models and count models based on pointwise mutual information. Under certain conditions, they showed that both models end up optimizing equivalent objective functions. This paper explores this connection in more detail and lays out the factors leading to differences between these models. We find that the most relevant differences from an optimization perspective are (i) predict models work in a low dimensional space where embedding vectors can interact heavily; (ii) since predict models have fewer parameters, they are less prone to overfitting. Motivated by the insight of our analysis, we show how count models can be regularized in a principled manner and provide closed-form solutions for L1 and L2 regularization. Finally, we propose a new embedding model with a convex objective and the additional benefit of being intelligible.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

I-34: NRY Haplotype Analysis: towards A Better Understanding of The Genetic Basis of Spermatogenic Failure

It has been established that the Y chromosome carries genes required for spermatogenesis and male fertility. For many decades worldwide screening for gene identification has been conducted in research laboratories. However, it has been a difficult process in identifying such genes (i.e. causative mutations) which could explain the phenotypic variation and could be potentially used as markers fo...

متن کامل

PREDICTING CLUSTER B PERSONALITY DISORDER ACCORDING TO FIVE FACTOR ALTERNATIVE MODELS ZUCKERMAN- KUHLMAN AND EGO STRENGTH

Abstract  Background& Aims:Due to the wide range of personality disorders and as well as alternative model DSM-5 for personality disorders, this study aimed to cluster B personality disorder according to five factor alternative models Zuckerman- Kuhlman and ego strength. Method:The study population is included all students of University of MohegheghArdabili in 2015(N=14000). A descriptive...

متن کامل

Prediction of fragmentation due to blasting using mutual information and rock engineering system; case study: Meydook copper mine

One of the key outcomes of blasting in mines is found to be rock fragmentation which profoundly affects downstream expenses. In fact, size prediction of rock fragmentation is the first leap towards the optimization of blasting design parameters. This paper makes an attempt to present a model to predict rock fragmentation using Mutual Information (MI) in Meydook copper mine. Ten parameters are c...

متن کامل

A closer look at rock physics models and their assisted interpretation in seismic exploration

Subsurface rocks and their fluid content along with their architecture affect reflected seismic waves through variations in their travel time, reflection amplitude, and phase within the field of exploration seismology. The combined effects of these factors make subsurface interpretation by using reflection waves very difficult. Therefore, assistance from other subsurface disciplines is needed i...

متن کامل

Rereading the Bystrom and Jarvelin's Information Seeking Behavior Model: Can the Scope of this Model Be Criticized?

Background and aim: Information seeking behaviors are the reflection of users' needs that Identifying and understanding them correctly is imperative in information seeking endeavors. Experts have presented cognitive and Process user-oriented approach models to better understand scholars’ information seeking behaviors.  The intent of models are to define and clarify the conditions that predict p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1511.02024  شماره 

صفحات  -

تاریخ انتشار 2015